Automatic normalization of short texts by combining statistical and rule-based techniques
نویسندگان
چکیده
منابع مشابه
Rule-Based Normalization of Historical Texts
This paper deals with normalization of language data from Early New High German. We describe an unsupervised, rulebased approach which maps historical wordforms to modern wordforms. Rules are specified in the form of context-aware rewrite rules that apply to sequences of characters. They are derived from two aligned versions of the Luther bible and weighted according to their frequency. The eva...
متن کاملCombining Statistical and Rule-Based Approaches to Morphological Tagging of Czech Texts
is article is an extract of the PhD thesis (Spoustová, 2007) and it extends the article (Spoustová et al., 2007). Several hybrid disambiguationmethods are describedwhich combine the strength of hand-written disambiguation rules and statistical taggers. ree different statistical taggers (HMM,Maximum-Entropy and Averaged Perceptron) and a large set of hand-written rules are used in a tagging ex...
متن کاملCombining Rule-Based and Statistical Syntactic Analyzers
This paper presents the results of a set of preliminary experiments combining two knowledge-based partial dependency analyzers with two statistical parsers, applied to the Basque Dependency Treebank. The general idea will be to apply a stacked scheme where the output of the rule-based partial parsers will be given as input to MaltParser and MST, two state of the art statistical parsers. The res...
متن کاملthe role of task-based techniques on the acquisition of english language structures by the intermediate efl students
this study examines the effetivenss of task-based activities in helping students learn english language structures for a better communication. initially, a michigan test was administered to the two groups of 52 students majoring in english at the allameh ghotb -e- ravandi university to ensure their homogeneity. the students scores on the grammar part of this test were also regarded as their pre...
15 صفحه اولCombining Phonology and Morphology for the Normalization of Historical Texts
This paper presents a proposal for the normalization of word-forms in historical texts. To perform this task, we extend our previous research on induction of phonology and adapt it to the task of normalization. In particular, we combine our earlier models with models for learning morphology (without additional supervision). The results are mixed: induction of the segmentation of morphemes fails...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Language Resources and Evaluation
سال: 2012
ISSN: 1574-020X,1574-0218
DOI: 10.1007/s10579-012-9187-y